Brief Communication Adjacency and proximity searching in the Science Citation Index and Google

نویسندگان

  • Ronald N. Kostoff
  • John T. Rigsby
  • Ryan B. Barth
چکیده

We have developed simple algorithms that allow adjacency and proximity searching in Google and the Science Citation Index (SCI). The SCI algorithm exploits the fact that SCI stopwords in a search phrase function as a placeholder. Such a phrase serves effectively as a fixed adjacency condition determined by the number n of adjacent stopwords (i.e., retrieve all records where word A and word B are separated by n words in at least one location). The algorithm integrates over search phrases with different numbers of adjacent stopwords to provide a flexible adjacency or proximity capability (i.e., retrieve all records where word A and word B are separated by n or less words in at least one location, where n is the maximum separation desired between A and B in at least one location). The Google algorithm exploits the fact that asterisks (in Google) separating words in a phrase function like word wildcards. The difference between two such phrases (the first phrase containing one less asterisk than the second phrase) serves effectively as a fixed adjacency or proximity condition, with the number of separating words equal to the number of asterisks in the first phrase. The algorithm integrates over these phrase differentials to provide a flexible adjacency or proximity capability (i.e., retrieve all records where word A and word B are separated by n or less words in at least one location, where n is the maximum separation desired between A and B in at least one location). Report Documentation Page Form Approved OMB No. 0704-0188 Public reporting burden for the collection of information is estimated to average 1 hour per response, including the time for reviewing instructions, searching existing data sources, gathering and maintaining the data needed, and completing and reviewing the collection of information. Send comments regarding this burden estimate or any other aspect of this collection of information, including suggestions for reducing this burden, to Washington Headquarters Services, Directorate for Information Operations and Reports, 1215 Jefferson Davis Highway, Suite 1204, Arlington VA 22202-4302. Respondents should be aware that notwithstanding any other provision of law, no person shall be subject to a penalty for failing to comply with a collection of information if it does not display a currently valid OMB control number. 1. REPORT DATE 01 FEB 2006 2. REPORT TYPE N/A 3. DATES COVERED 4. TITLE AND SUBTITLE Adjacency and Proximity Searching in the Science Citation Index and Google 5a. CONTRACT NUMBER

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adjacency and Proximity Searching in the Science Citation Index and Google

We have developed simple algorithms that allow adjacency and proximity searching in Google and the Science Citation Index (SCI). The SCI algorithm exploits the fact that SCI stopwords in a search phrase function as a placeholder. Such a phrase serves effectively as a fixed adjacency condition determined by the number n of adjacent stopwords (i.e., retrieve all records where word A and word B ar...

متن کامل

Investigating the Effect of Spatial Proximity on Iran University- Industry Co-publications by using Gravity Model

Background and Aim: Due to the importance of scientific relations between university and industry, it is so important to identify the factors that affect these relations. So,the aim of this study is to investigate the effect of spatial proximity on university- industry collaboration. The collaboration indicator which is used here is University- Industry Co-publications. Methods: The research is...

متن کامل

A Proposed Scheme for Remedy of Man-In-The-Middle Attack on Certificate Authority

Australian Business Deans Council (ABDC); Bacon’s Media Directory; Cabell’s Directories; Compendex (Elsevier Engineering Index); CSA Illumina; DBLP; Gale Directory of Publications & Broadcast Media; GetCited; Google Scholar; INSPEC; JournalTOCs; Library & Information Science Abstracts (LISA); MediaFinder; Norwegian Social Science Data Services (NSD); SCOPUS; The Index of Information Systems Jou...

متن کامل

Communication competencies of nursing managers: A review study

Introduction: Nursing managers have extensive relationships in the organization and Communication is the key to working processes and relationships to achieve organizational goals; Therefore, nursing managers should be aware of the communication process and communication competencies. Communication competence is a set of knowledge, skills and attitudes that reflect job performance and can be as...

متن کامل

Nursing Inter Professional Communication Challenges: A systematic Review

Introduction: Communication is an essential issue in performing care and treatment measures of nursing. There are extensive inter-professional communication colleagues in the nursing profession, patients and families, and other professions, which is necessary to do teamwork, and maintaining this relationship of nursing is essential. Being aware of the challenges of inter-professional communica...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Information Science

دوره 32  شماره 

صفحات  -

تاریخ انتشار 2006